Inferring long memory processes in the climate network via ordinal pattern analysis 
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We use ordinal patterns and symbolic analysis to construct global climate networks and uncover 
long and short term memory processes. The data analyzed is the monthly averaged surface air 
temperature (SAT field) and the results suggest that the time variability of the SAT field is deter- 
mined by patterns of oscillatory behavior that repeat from time to time, with a periodicity related 
to intraseasonal oscillations and to El Nino on seasonal-to-interannual time scales. 
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We analyze climatological data from a complex 
networks perspective, using techniques of non- 
linear time-series symbolic analysis. Specifically, 
we employ ordinal patterns and binary represen- 
tations to analyze monthly-averaged surface air 
temperature (SAT) anomalies. By computing the 
mutual information of the time-series in regular 
grid points covering the Earth's surface and then 
performing global thresholding, we construct cli- 
mate networks which uncover short-term mem- 
ory processes, as well as long ones (5-6 years). 
Our results suggest that the time variability of the 
SAT anomalies is determined by patterns of os- 
cillatory behavior that repeat from time to time, 
with a periodicity related to intraseasonal varia- 
tions and to El Nino on seasonal to interannual 
time scales. The present work is located at the 
triple intersection of three highly active inter- 
disciplinary research fields in nonlinear science: 
symbolic methods for nonlinear time series anal- 
ysis, network theory, and nonlinear processes in 
the earth climate. While a lot of effort is be- 
ing done in order to improve our understanding 
of natural complex systems, with many different 
methods for mapping time series to network rep- 
resentations being investigated and employed in 
complex systems such as the human brain, our 
work is the first one aimed at characterizing the 
global climate network in terms of oscillatory pat- 
terns that tend to repeat from time to time, with 
various time scales. By mapping these processes 
into a global network, using ordinal patterns and 
binary representations, we find that the struc- 
ture of the network changes drastically at differ- 
ent time scales. 



I. INTRODUCTION 

Complex networks have been intensively studied in the 
last years because they represent many real systems such 
as the Internet, ecological, social and metabolic networks, 
genes, cells and the brain Global climate modeling 
is also a hot topic nowadays because of its huge eco- 
nomic and social impact for future generations. Giving 
the complexity of the inter-relations between the differ- 
ent elements that constitute our environment, it is im- 
portant to analyze climatological data from a complex 
network perspective. However, despite the intensive ef- 
fort in research done in these two interdisciplinary and 
fascinating fields, just very few studies have combined 
both [2-8]. These studies have shown that network the- 
ory can yield light into interesting, previously unknown 
features of our climate. 

Tsonis and Swanson Q and Yamasaki, Gozolchiani 
and Halvin Q have shown that the climate network is 
significantly affected by El Nino, as during El Nino years 
many links of the network are broken. Tsonis and Swan- 
son [|| constructed cross-correlation-based networks of 
the SAT field for El Nino and for La Nina years and in- 
vestigated their structure. They found that the El Nino 
network possesses significantly fewer links and lower clus- 
tering coefficient and characteristic path length than the 
La Nina network. They conjectured and verified that, 
because El Nino network is less communicative and less 
stable than La Nina one, during El Nino years temper- 
ature predictability is lower compared to La Nina years. 
Using a different approach, Yamasaki, Gozolchiani and 
Halvin Q arrived at a similar conclusion. They devel- 
oped a method which allows to follow time variations 
of the network structure by observations of fluctuations 
in the correlations between nodes. The method allows to 
distinguish between the two qualitatively different groups 
of network links, blinking links that appear and disap- 
pear in a short time, and robust links that represent long 
lasting relations between temperature fluctuations in two 
regions. Assuming that broken links are due to structural 
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changes in the network, by tracking these changes in sev- 
eral zones a strong response to El Nino was reveled, even 
in geographical regions where the mean temperature is 
not affected by El Nino. 

Donges et al. [(| compared the structural properties of 
networks constructed by using, as a measure of dynami- 
cal similarity between regions, linear and nonlinear mea- 
sures: the linear Pearson correlation coefficient and the 
nonlinear mutual information. They analyzed two sets of 
data: the SAT anomalies obtained from large-scale cli- 
mate simulations by the coupled atmosphere-ocean gen- 
eral circulation models and the SAT anomalies reanalysis 
data sets. A high degree of similarity using the two ap- 
proaches (linear and nonlinear similarity measures) was 
found on the local and on the mesoscopic topological 
scales; however, important differences were uncovered on 
the global scale, particularly in the betweenness central- 
ity field. In Q Donges et al. employed the mutual in- 
formation to reveal wave-like structures of high-energy 
flow, that could be traced back to global surface ocean 
currents. Their results point to the major role of the 
oceanic surface circulation in coupling and stabilizing the 
global temperature field in the long-term mean. 

When computing the mutual information, in order to 
detect patterns and correlations in the variability of two 
nodes, a critical issue is defining probability distribu- 
tion functions (PDFs) that fully take into account the 
temporal order in which the SAT anomalies occur in the 
time-series. Histogram-based PDFs do not take into ac- 
count this temporal order, and thus, are not optimal for 
capturing subtle correlated oscillatory patterns. Alter- 
natively, one can use time-delay embedding techniques 
to represent the time series as a trajectory in a high- 
dimensional space; however, the information provided by 
the mutual information is strongly dependent on the em- 
bedding technique, the time-delay, and the phase space 
partition @. 

An alternative methodology, originally proposed by 
Bandt and Pompe (BP) allows to define probabil- 
ity distribution functions that fully take into account the 
time ordering of the SAT anomalies. The BP method 
is based on comparing values in the time-series to con- 
struct "ordinal patterns". By computing the PDF of 
the possible ordinal patterns, various information-theory 
quantifiers, such as the permutation entropy, the mutual 
information, complexity measures, etc. can be computed. 
The BP method has been successfully employed to ana- 
lyze time-series generated from physical, biological and 
social systems (see, e.g., ll| and references therein). 

When employing the BP methodology the precise val- 
ues of the SAT anomalies are neglected (as the method 
is based on comparing relative values in the time-series); 
however, as we will show, with the BP method one can 
identify patterns of oscillatory behavior that tend to re- 
peat from time to time, with various time scales. A 
drawback of the BP method is that, in order to cap- 
ture long memory processes, long time series are needed 
to compute the PDFs of the ordinal patterns with good 



statistics. The SAT data available (described in the next 
section) limited us to construct ordinal patterns of max- 
imum length 5, which allows to consider time-scales up 
to 5 years or 5 months. To overcome this limitation we 
employed "binary representations" , by which the time- 
series of SAT anomalies were transformed into sequences 
of Os and Is. These binary representations allowed to 
consider processes with longer time-scales, up to 6 years 
or 6 months. We will show in what follows that ordinal 
patterns and binary representations are tools that, when 
employed within a complex network perspective, are very 
powerful for the analysis of climatological data. By revel- 
ing long term and short term memory processes they pro- 
vide additional information to that obtained from con- 
ventional time-series analysis and thus they help to a 
better understanding of our complex climate. 

This article is organized as follows: Section II presents 
the description of the data analyzed and a summary of 
the methodology employed. Section III presents the re- 
sults obtained with ordinal patterns and binary represen- 
tations, and a comparison with the methodologies previ- 
ously employed by other authors (i.e., the linear cross- 
correlation [Jl and the nonlinear histogram-based mutual 
information [y|). Section IV contains a discussion of the 
results and the conclusions. 



II. DATA AND METHODS 

We present the analysis of the monthly averaged sur- 
face air temperature (SAT field, reanalysis data from the 
National Center for Environmental Prediction/National 
Center for Atmospheric Research, NCEP/NCAR [II]). 
As in [H, |H, |(| 0], anomaly values are considered (i.e., the 
actual temperature value minus the monthly average). 

The data covers a regular grid over the earth's sur- 
face with latitudinal and longitudinal resolution of 2.5°. 
These N = 10226 grid points are considered the nodes 
(or vertices) of a network (or graph), and the exis- 
tence of a link (or edge) between any two nodes de- 
pends on the "weight" of the link that measures the 
degree of statistical similarity between the climate dy- 
namics in those two nodes. The data covers the period 
January 1949-Dccember 2006, and therefore in each grid 
point i (i = 1 . . . N) we have M = 696 data points, 
{xi{t),t = 1...M}. W = {wij,i,j = 1...N} is the 
matrix that contains the weights that characterize the 
links between any two nodes. Since we don't attempt to 
uncover directionality in the couplings among the nodes, 
we will consider a symmetric measure of statistical simi- 
larity that results in symmetric weights. 

In [HQ these weights were quantified with the absolute 
value of the linear cross-correlation coefficient; in 0, 0], 
with the mutual information, a nonlinear measure that 
is a function of the probability density functions (PDFs) 
that characterize the time series in the two nodes, Pi(m) 
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and pj(n), as well as of the joint probability, pij(m,n), 

p %:j {m,n) 



W i3 = Mij = ^2pij(m, n) log 



Pi(m)pj(n) 



(1) 



The mutual information, which can also be written as 



Mi, 



Si + Sj S ij , 



(2) 



where Si = -J2Pi^°&Pi^ Sj = -J2Pj^°&Pj an d £y = 
— YlPij ]°EPij) indicates the amount of information of 
{xi(t)}, we obtain by knowing {xj(t)}, and vice versa. 
Mij measures the degree of statistical interdependence 
of the time series; if they are independent, pij(rn,n) = 
Pi(m)pj(n) and My = 0. 

To uncover correlated "patterns" of oscillatory behav- 
ior in the SAT anomalies, we employ the methodologies 
referred to as ordinal patterns and symbolic analysis, 
which arc based on comparing consecutive values in the 
time series, to compute the PDFs in Eq.(|TJ|. We begin 
by presenting the ordinal pattern methodology [Io| . 

First, in each grid point i, the time series {xj(t)} is 
divided into M — D overlapping vectors of dimension D. 
Then, each element of a vector is replaced by a number 
from to D — 1, in accordance with its relative magnitude 
in the ordered sequence (0 corresponding to the smallest 
and D — 1 to the largest value in each vector) . For exam- 
ple, with D = 3 the vector (vo,Vi,V2) = (6.8, 11.5, 1.1), 
gives the ordinal pattern 201 because i>2 < Vq < v\. In 
this way, each vector has associated an "ordinal pattern" 
(OP) composed by D symbols, and the symbol sequence 
comes from a comparison of neighboring values. Last, 
one computes the PDF of the Dl possible ordinal pat- 
terns. For example, with D = 3 the 3! = 6 different 
patterns are (012, 021, 102, 120, 201 and 210), and thus, 
the PDF is calculated with 6 bins. To have a good statis- 
tics one must have M — D » Dl (i.e., # of OPs in the 
time series >> # of possible OPs). 

Because in each time series we have M = 696 data 
points, to compute the PDFs with good statistics we limit 
to consider only D — 4 and D = 5. Ordinal patterns of 
D < 3 do not provide good resolution for computing 
the mutual information, Eq. [TJ because the PDFs are 
calculated with very few bins (for D = 6, there are only 
6 ordinal patterns, and thus, only 6 bins). 

With climatological data meaningful ordinal patterns 
can be formed either by comparing consecutive years or 
consecutive months. Specifically, if we use D = 3, when 
comparing consecutive years, the OPs in node i are de- 
fined by (xi(t),Xi(t + 12), Xi(t + 24)), t = l,...,M-24; 
when comparing consecutive months, they are defined by 
(x i (t),x i (t + l),x i (t + 2)), t= 1,...,M -2. 

To decide whether there is a link between two nodes, 
we perform global thresholding [li|, i.e., we define a 
threshold r (which is the same for all pairs of nodes) 
and assume that there is a link between i and j if the 
weight of the link is above the threshold, i.e., Wij > r. 

Clearly, a careful selection of the threshold is crucial 
for uncovering the backbone of the network Q . 



We use the following procedure: first, we check that we 
only take into account significant network connections. 
To do this we compute the weight matrix W from ran- 
domly shuffled time series in each node. The random 
elements of this 10226 x 10226 matrix have a very narrow 
PDF which, in principle, allows the use of the maximum 
matrix element, w m ax, as a significant limit. Then, we 
compute W with the original time series and consider 
that there is a significant link between the nodes i and 
j if Wij > w max , otherwise, we set = 0. While there 
are several methods to eliminate non-significant links and 
the evaluation of statistical significance is still an open 
problem (see, e.g., the discussion in Q), this procedure is 
computationally cost-efficient and we will show in what 
follows that allows to uncover meaningful climate net- 
works. The drawback is that it is a rather strong test 
that eliminates weak but significant links, and as a re- 
sults the networks tend to be very spare. The final step 
is to chose a threshold t to select the strongest links. 
As in Q, we chose r such that the resulting networks 
have a pre-determined number of links. In the follow- 
ing we present results for networks that have 1% of the 
total possible links (which will be referred to as "low- 
threshold" networks) and networks containing 0.1% of 
the total possible links (referred to as "high-threshold" 
networks). For easy comparison and to visualize the ef- 
fect of thresholding, we also present the networks con- 
taining all the significant links, which will be referred to 
as the "zero-threshold" networks. 

The networks are represented graphically as two- 
dimensional maps by plotting the area-weighted connec- 
tivity 0, H, H, ; which is the fraction of the total area of 
the earth to which each node i is connected, 



AWd = Sj 



N 



Aij cos(Ai) 



Ef cos(Aj) 



(3) 



1 if nodes 



where Xi is the latitude of node i and 
i and j are connected (i.e., if > r) and otherwise. 
The cosine terms correct for the fact that in a surface 
spherical network defined on a regular planar grid, the 
nodes correspond to regions of different area. 



III. RESULTS 
A. Ordinal Patterns Analysis 

The networks obtained when the ordinal patterns are 
defined by comparing SAT anomalies in consecutive years 
and in consecutive months are displayed in Figs. [T]|4] In 
each panel the values of the threshold, r, and of the edge- 
density, 



V w A ■ 
N(N - 1)' 



(4) 



arc indicated. 
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x = p = 0.027 1 = 0.227 p = 0.01 x = 0.504 p = 0.001 
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FIG. 1. Zero-threshold (left), low-threshold (center) and high-threshold (right) networks constructed by computing the mutual 
information from ordinal patterns of length D = 4 defined by comparing SAT anomalies in consecutive years. The 2D plots are 
color-coded such that the white (red) regions indicate the geographical areas with zero (largest) area weighted connectivity. In 
each panel the values of the threshold, r, and of the edge-density, p, are indicated. 

T = p = 0.006 1 = 0.674 p = 0.001 
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FIG. 2. Zero-threshold (left) and high-threshold (right) networks constructed by computing the mutual information from 
ordinal patterns of length D — 5 defined by comparing SAT anomalies in consecutive years. The 2D plots are color-coded 
such that the white (red) regions indicate the geographical areas with zero (largest) area weighted connectivity. The weaker 
links lose a bit of memory (compare the zero-threshold networks with D = 4 and D — 5) while the strong links do not, as the 
high-threshold networks are the nearly same for D = 4 and D = 5. 



For consecutive years the networks with D = 4, Fig. 
[T] and D = 5, Fig. [2] are very similar showing highest 
connectivity for the tropical region. 

For zero-threshold (left panels in Figs. [1] and [2]) the 
tropical Pacific shows the largest connectivity, particu- 
larly in the central and eastern side of the basin; the 
tropical Atlantic and Indian oceans follow. In the ex- 
tratropics there are patches of high connectivity off the 
western coast of Canada in the Northern Hemisphere 
(N.H.) and in the south Pacific in the Southern Hemi- 
sphere (S.H.). This connectivity structure is more pro- 
nounced for D = 5, although some of the weak links 
lose memory. These characteristics hint to El Nino as a 
fundamental player in setting up these connections flij ]. 
The El Nino phenomenon occurs on interannual time 
scales and consists in an anomalous warming of the east- 
ern equatorial Pacific. This warming in turn warms up 
the local atmosphere and influences other tropical regions 
through the excitation of equatorial Kelvin and Rossby 



waves. Changes in the precipitation associated with El 
Nino also induce stationary Rossby waves in the north- 
ern and southern extratropics that generate long range 
connections called atmospheric teleconnection patterns. 
Examples of these structures are the Pacific-North Amer- 
ican pattern that affects the northern Pacific and North 
America, and the Pacific-South American pattern that 
propagates in the southern Pacific toward South Amer- 
ica. These anomalous structures connect the tropical Pa- 
cific with remote locations and affect the local climate by 
changing, for example, the advection of heat or moisture 
into a region. 

For non-zero-thresholds (center and right panels in 
Figs. [T] and [2J only the strongest links remain and the 
networks clearly show again an El Nino-like structure in 
the tropical Pacific. Secondary maxima in the Indian and 
Atlantic oceans are present in the low-threshold networks 
but arc significantly weakened in the high-threshold ones. 
The continental regions have overall very low conncc- 
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x = p = 0.018 x = 0.186 p = 0.01 1 = 0.476 p = 0.001 




FIG. 3. As Fig. [T]but D — 4 ordinal patterns denned by comparing SAT anomalies in consecutive months. The 2D plots of the 
area weighted connectivity are color-coded such that the white (red) regions indicate the geographical areas with zero (largest) 
area weighted connectivity. 

x = p = 0.004 % = 0.656 p = 0.001 





FIG. 4. Zero-threshold (left) and high-threshold (right) networks constructed by computing the mutual information from 
ordinal patterns of length D = 5 defined by comparing SAT anomalies in consecutive months. The 2D plots of the area 
weighted connectivity are color-coded such that the white (red) regions indicate the geographical areas with zero (largest) area 
weighted connectivity. 



tivity which translates in the low predictability of sur- 
face temperature anomalies on interannual time scales. 
Within these continental regions, the largest connectiv- 
ity is seen over Asia and North America, the latter max- 
imum being perhaps due to the Pacific North American 
pattern induced by El Nino . 

The networks obtained when the ordinal patterns are 
defined by comparing temperature anomalies in consecu- 
tive months, Figs. [31 HI present, for zero- and low- thresh- 
old, similar features as for consecutive years, although the 
networks are more homogeneous. There is a maximum 
in the equatorial Pacific, a secondary maximum in the 
Indian ocean and extratropical maxima over Asia, North 
America and southern subtropics. On the other hand, 
the high threshold network shows that the strongest links 
are located in the extra tropics. We speculate that this 
could be a result of the modulation of the temperature 
variance by the seasonal cycle, which is strongest over the 
northern hemisphere continental masses and has a mini- 



mum in the tropical band. This network structure is also 
seen when using ordinal patterns formed by 5 consecutive 
months, Fig. |4j 



B. Binary representations 

To capture longer memory processes one should use 
larger D values; however, for D = 6 there are 6! = 720 
possible ordinal patterns, and since we have time series 
with less than 700 data points, there is not enough data 
to calculate ordinal patterns PDFs with good statistics. 

As discussed in the introduction, a solution to over- 
come this problem is employing "binary representations" , 
by which the time series {xi(t)} is transformed into a 
sequence {vi(t)} of 0s and Is, using the following rule: 
Vi(t) — if Xi(t) > and vi(t) = 1 otherwise (since 
the Xi values are temperature anomalies, we are taking 
into account whether the SAT field is above or below its 
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FIG. 5. As Fig. [TJbut employing binary representation. Zero-threshold (left), low-threshold (center) and high-threshold (right) 
networks constructed by computing the mutual information from patterns of length D — 4 defined by comparing SAT anomalies 
in consecutive years. The 2D plots are color-coded such that the white (red) regions indicate the geographical areas with zero 
(largest) area weighted connectivity. In each panel the values of the threshold, r, and of the edge-density, p, are indicated. 

t = p = 0.021 t = 0.331 p = 0.01 x= 0.566 p = 0.001 
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FIG. 6. As Fig. [5] but with D = 5. 



monthly averaged value). We can then define "binary 
patterns" of dimension D (e.g., for D = 3 the possible 
patters are 000, 001, 010, 100, 011, 110, 101 and 111) 
and compute their PDF. The number of different pat- 
terns is 2 D , and thus, we can calculate PDFs of patterns 
of D = 6 (2 6 = 64) with good enough statistics. Patterns 
with D < 3 do not provide good resolution for computing 
the mutual information, Eq. [1] because the PDFs are cal- 
culated with very few bins. Therefore, in the following, 
we consider D = 4, 5, and 6. 

Figures IMTOl present the results when the binary pat- 
terns are defined by consecutive years and months. 

For consecutive years, Figs. [SKI the networks obtained 
when using binary representations are very similar to 
those found with ordinal patterns. The tropical regions 
are quite uniformly well connected (although a Pacific 
maximum is clear) while the extratropics show localized 
regions of high connectivity likely due to atmospheric 
teleconnections forced from the tropics, particularly for 
low density networks. 

The networks obtained for consecutive months, Figs. 
[51ITU1 show that as the threshold or as D increases there 
are overall similar changes in structure as those seen for 



ordinal patterns, Figs. |3l [4] for short memory processes 
(or for low threshold) the maximum connectivity is in 
the tropics, while for longer time scales (or for higher 
threshold) the extratropics show largest number of links. 
As discussed before, this could be the result of the mod- 
ulation of the temperature variance by the seasonal cy- 
cle, which would be the main process that connects grid 
points in very low density networks (grid points that are 
connected by very strong links). Our results could also 
hint at the role of land surface conditions like snow or soil 
humidity in increasing the persistence of surface temper- 
ature anomalies over the northern hemisphere continents. 
Overall, these results agree well with the fact that tem- 
perature teleconnections from the tropical Pacific tend to 
last no much longer than a season in the different parts 
of the world. 

As a way to test the interpretation of the above pre- 
sented results, in terms of the symbolic methodology of 
time-series analysis capturing two different time-scales of 
the Earth's climate, seasonal and intcrannual, we con- 
structed binary patterns of fixed dimension, D — 6, that 
cover three different time intervals: 
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FIG. 7. As Fig. [5] but with D — 6. Comparing with Figs. [5] and [6] one can see that there is a good agreement with the results 
obtained previously with ordinal patterns: the weaker links tend to lose memory, while the strongest links do not. 
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FIG. 8. As Fig[5]but the networks constructed with binary representation comparing anomalies in D = 4 consecutive months. 
The 2D plots of the area weighted connectivity are color-coded such that the white (red) regions indicate the geographical 
areas with zero (largest) area weighted connectivity. 



i) covering one- year, the patterns are composed as 
[H (t), Xi(t + 2), n (t + 4), . . . , Xi (t + 10)] ; 

ii) covering two-years, the patterns are composed as 
[xi(t),Xi(t + 4), Xi(t + 8), . . . , Xi(t + 20)]; 

ii) covering three- years, the patterns are composed as 

[ Xi (t), x t (t + 6), Xi(t + 12), . . . ,Xi(t + 30)]. 

The results are presented in Fig llll were one can see how 
the network changes. For a time interval of one year the 
extra tropics have the largest number of links and there 
are very few in the tropical region. On the other hand, 
for a time interval of three years, the extra-tropics keep 
about the same number of connections while in tropi- 
cal Pacific "El Nino" stands out (note the different color 
scales in the panels in Figll 1|) . In summary, confirming 
the results previously found with ordinal patterns and bi- 
nary representations composed by consecutive years and 
by consecutive months, on intra-seasonal time scales the 



extra tropical connections dominate, while on interan- 
nual scales, the "El Nino" is the key player in setting up 
teleconnections worldwide. 

As it was previously discussed, the significance of the 
network links was tested in comparison with links com- 
puted from surrogate time-series in each node. Since sur- 
rogate data does not preserve the autocorrelation proper- 
ties of the original time-series, to further test the validity 
of the previously presented results, we did the following 
test: we computed the mutual information using the orig- 
inal time series in node i and the time-inverted series in 
node j. The results show no significant spatial structure 
in the area weighted connectivity plots (not shown). 



C. Comparison with other measures 

It is interesting to compare the results obtained us- 
ing ordinal patterns and binary representations with 
those obtained using conventional techniques of time- 
series analysis, as the linear cross-correlation coefficient 
(as in Ref. Q) and the mutual information, computing 
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FIG. 9. As Fig. H] but with D = 5. 
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FIG. 10. As Fig. M but with D = 6. 



the PDFs from standard histograms of amplitude values 
(as in Refs. 0, 0])- Figure [12] displays the zero, low 
and high threshold networks when the weights are cal- 
culated with the absolute value of the cross-correlation 
coefficient coefficient and Fig. [T3l when they are calcu- 
lated with mutual information, with the PDFs calculated 
from histograms of temperature anomaly values. In this 
case the PDFs were computed employing 32 bins and in 
each time-series the values of the SAT anomalies were 
re-normalized such that each time-series has zero mean 
and standard deviation equal to one (asinRef. [1,0, for 
easier comparison). 

The 2D plots of the area weighted connectivity are sim- 
ilar to those previously reported in Ref. [f| and also, 
to those seen in Figs. [U [H El [HI where the ordinal 
patterns and the binary representations are formed by 
comparing consecutive years. "El Nino" is the main fea- 
ture uncovered. There are also regions with relatively 
high number of links in the northern hemisphere conti- 
nents and southern subtropics, but the high connectiv- 
ity in the extratropics seen previously in Figs. G2 [H [H 
[HI and [TU] is not observed. In other words, employing 
the cross correlation coefficient or the histogram-based 



mutual information uncovers mainly the interannual net- 
work. These methodologies fail to separate the two dis- 
tinct time-scales (intra-scasonal and inter-annual) that 
are clearly seen when using symbolic analysis and the 
time series are transformed in sequences of patterns by 
comparing consecutive years or consecutive months. 



IV. CONCLUSIONS 

Concluding, we have shown that ordinal patterns and 
symbolic analysis applied to anomalies of the surface air 
temperature are powerful tools for the analysis of the 
large-scale topology of the climate network. The success 
of these methods is based on an appropriate partition 
of the phase space that results in ordinal patterns and 
binary representations having PDFs that characterize the 
diversity of patterns present in the climate dynamics. 

A main advantage of the methodology proposed here 
is that by varying the dimension of the pattern and the 
year-month comparison, one can uncover memory pro- 
cesses with different time scales. We found that both, 
monthly and yearly patterns reveal long memory pro- 
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FIG. 11. High-threshold networks constructed with binary representations, with patterns of D = 6 covering time-intervals of 
one year (left), two years (center) and three years (right). The 2D plots of the area weighted connectivity are color-coded such 
that the white (red) regions indicate the geographical areas with zero (largest) area weighted connectivity. See text for details. 
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FIG. 12. Zero-threshold network (left) and non-zero-threshold networks (center and right) constructed by estimating the weights 
with the absolute value of the cross-correlation coefficient. The 2D plots of the area weighted connectivity are color-coded such 
that the white (red) regions indicate the geographical areas with zero (largest) area weighted connectivity. 



cesses, and that depending on the time scale considered 
the climate network can change completely. 

The fact that ordinal patterns and symbolic analysis 
give meaningful information indicates that the time vari- 
ability of the anomaly SAT field is strongly determined 
by patterns of oscillatory behavior that tend to repeat 
from time to time. 

Overall we found that on seasonal time-scales the ex- 
tratropical regions, mainly over Asia and North Amer- 
ica, present the strongest links while in interannual time 
scales, the tropical Pacific clearly dominates. 
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